Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 91047 |
| Missing cells | 173711 |
| Missing cells (%) | 7.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 18.8 MiB |
| Average record size in memory | 216.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 15 |
| Unsupported | 4 |
tipo_persona has constant value "Natural" | Constant |
entidad has a high cardinality: 151 distinct values | High cardinality |
tipo_persona is highly correlated with genero and 12 other fields | High correlation |
genero is highly correlated with tipo_persona | High correlation |
tiene_casa_propia is highly correlated with tipo_persona | High correlation |
estado_civil is highly correlated with tipo_persona | High correlation |
codeudor is highly correlated with tipo_persona | High correlation |
municipio_expedicion is highly correlated with tipo_persona and 2 other fields | High correlation |
tipo_identificacion is highly correlated with tipo_persona | High correlation |
forma_pago is highly correlated with tipo_persona | High correlation |
municipio_nacimiento is highly correlated with tipo_persona and 2 other fields | High correlation |
estado_final is highly correlated with tipo_persona | High correlation |
tipo_venta is highly correlated with tipo_persona | High correlation |
periodo_credito is highly correlated with tipo_persona | High correlation |
municipio_residencia is highly correlated with tipo_persona and 2 other fields | High correlation |
municipio_credito is highly correlated with tipo_persona | High correlation |
empresa has 3993 (4.4%) missing values | Missing |
cargo has 6020 (6.6%) missing values | Missing |
tiempo_servicio has 17808 (19.6%) missing values | Missing |
otros_ingresos_mensual has 69004 (75.8%) missing values | Missing |
otros_ingresos_concepto has 76885 (84.4%) missing values | Missing |
sueldo is highly skewed (γ1 = 55.32516627) | Skewed |
otros_ingresos_mensual is highly skewed (γ1 = 55.02864117) | Skewed |
valor_credito is highly skewed (γ1 = 48.6531017) | Skewed |
Row is uniformly distributed | Uniform |
Row has unique values | Unique |
empresa is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
cargo is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
tiempo_servicio is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
otros_ingresos_concepto is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
otros_ingresos_mensual has 15957 (17.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-05 21:28:21.415092 |
|---|---|
| Analysis finished | 2021-05-05 21:29:00.133553 |
| Duration | 38.72 seconds |
| Software version | pandas-profiling v2.12.0 |
| Download configuration | config.yaml |
| Distinct | 91047 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45524 |
| Minimum | 1 |
|---|---|
| Maximum | 91047 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 711.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4553.3 |
| Q1 | 22762.5 |
| median | 45524 |
| Q3 | 68285.5 |
| 95-th percentile | 86494.7 |
| Maximum | 91047 |
| Range | 91046 |
| Interquartile range (IQR) | 45523 |
Descriptive statistics
| Standard deviation | 26283.14932 |
|---|---|
| Coefficient of variation (CV) | 0.5773470986 |
| Kurtosis | -1.2 |
| Mean | 45524 |
| Median Absolute Deviation (MAD) | 22762 |
| Skewness | 0 |
| Sum | 4144823628 |
| Variance | 690803938 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 2047 | 1 | < 0.1% |
| 43648 | 1 | < 0.1% |
| 27288 | 1 | < 0.1% |
| 25241 | 1 | < 0.1% |
| 31386 | 1 | < 0.1% |
| 29339 | 1 | < 0.1% |
| 19100 | 1 | < 0.1% |
| 17053 | 1 | < 0.1% |
| 23198 | 1 | < 0.1% |
| 21151 | 1 | < 0.1% |
| Other values (91037) | 91037 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
| Value | Count | Frequency (%) |
| 91047 | 1 | |
| 91046 | 1 | |
| 91045 | 1 | |
| 91044 | 1 | |
| 91043 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| Cedula de Ciudadania | |
|---|---|
| Cedula de Extranjeria | 262 |
| Tarjeta de Identidad | 23 |
| Registro Civil | 17 |
| NIT | 16 |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 19.99852823 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1820806 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Cedula de Ciudadania |
|---|---|
| 2nd row | Cedula de Ciudadania |
| 3rd row | Cedula de Ciudadania |
| 4th row | Cedula de Ciudadania |
| 5th row | Cedula de Ciudadania |
| Value | Count | Frequency (%) |
| Cedula de Ciudadania | 90727 | |
| Cedula de Extranjeria | 262 | 0.3% |
| Tarjeta de Identidad | 23 | < 0.1% |
| Registro Civil | 17 | < 0.1% |
| NIT | 16 | < 0.1% |
| Pasaporte | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| de | 91012 | |
| cedula | 90989 | |
| ciudadania | 90727 | |
| extranjeria | 262 | 0.1% |
| tarjeta | 23 | < 0.1% |
| identidad | 23 | < 0.1% |
| registro | 17 | < 0.1% |
| civil | 17 | < 0.1% |
| nit | 16 | < 0.1% |
| pasaporte | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 363767 | |
| d | 363524 | |
| e | 182328 | |
| 182041 | ||
| i | 181790 | |
| C | 181733 | |
| u | 181716 | |
| n | 91012 | 5.0% |
| l | 91006 | 5.0% |
| r | 566 | < 0.1% |
| Other values (14) | 1323 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1456657 | |
| Uppercase Letter | 182108 | 10.0% |
| Space Separator | 182041 | 10.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 363767 | |
| d | 363524 | |
| e | 182328 | |
| i | 181790 | |
| u | 181716 | |
| n | 91012 | 6.2% |
| l | 91006 | 6.2% |
| r | 566 | < 0.1% |
| t | 327 | < 0.1% |
| j | 285 | < 0.1% |
| Other values (6) | 336 | < 0.1% |
| Value | Count | Frequency (%) |
| C | 181733 | |
| E | 262 | 0.1% |
| I | 39 | < 0.1% |
| T | 39 | < 0.1% |
| R | 17 | < 0.1% |
| N | 16 | < 0.1% |
| P | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 182041 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1638765 | |
| Common | 182041 | 10.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 363767 | |
| d | 363524 | |
| e | 182328 | |
| i | 181790 | |
| C | 181733 | |
| u | 181716 | |
| n | 91012 | 5.6% |
| l | 91006 | 5.6% |
| r | 566 | < 0.1% |
| t | 327 | < 0.1% |
| Other values (13) | 996 | 0.1% |
| Value | Count | Frequency (%) |
| 182041 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1820806 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 363767 | |
| d | 363524 | |
| e | 182328 | |
| 182041 | ||
| i | 181790 | |
| C | 181733 | |
| u | 181716 | |
| n | 91012 | 5.0% |
| l | 91006 | 5.0% |
| r | 566 | < 0.1% |
| Other values (14) | 1323 | 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| Femenino | |
|---|---|
| Masculino |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.458653223 |
| Min length | 8 |
Characters and Unicode
| Total characters | 770135 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Femenino |
|---|---|
| 2nd row | Femenino |
| 3rd row | Femenino |
| 4th row | Femenino |
| 5th row | Femenino |
| Value | Count | Frequency (%) |
| Femenino | 49288 | |
| Masculino | 41759 |
| Value | Count | Frequency (%) |
| femenino | 49288 | |
| masculino | 41759 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 140335 | |
| e | 98576 | |
| i | 91047 | |
| o | 91047 | |
| F | 49288 | 6.4% |
| m | 49288 | 6.4% |
| M | 41759 | 5.4% |
| a | 41759 | 5.4% |
| s | 41759 | 5.4% |
| c | 41759 | 5.4% |
| Other values (2) | 83518 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 679088 | |
| Uppercase Letter | 91047 | 11.8% |
Most frequent character per category
| Value | Count | Frequency (%) |
| n | 140335 | |
| e | 98576 | |
| i | 91047 | |
| o | 91047 | |
| m | 49288 | 7.3% |
| a | 41759 | 6.1% |
| s | 41759 | 6.1% |
| c | 41759 | 6.1% |
| u | 41759 | 6.1% |
| l | 41759 | 6.1% |
| Value | Count | Frequency (%) |
| F | 49288 | |
| M | 41759 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 770135 |
Most frequent character per script
| Value | Count | Frequency (%) |
| n | 140335 | |
| e | 98576 | |
| i | 91047 | |
| o | 91047 | |
| F | 49288 | 6.4% |
| m | 49288 | 6.4% |
| M | 41759 | 5.4% |
| a | 41759 | 5.4% |
| s | 41759 | 5.4% |
| c | 41759 | 5.4% |
| Other values (2) | 83518 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 770135 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 140335 | |
| e | 98576 | |
| i | 91047 | |
| o | 91047 | |
| F | 49288 | 6.4% |
| m | 49288 | 6.4% |
| M | 41759 | 5.4% |
| a | 41759 | 5.4% |
| s | 41759 | 5.4% |
| c | 41759 | 5.4% |
| Other values (2) | 83518 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| Union Libre | |
|---|---|
| Casado | |
| Soltero | |
| Viudo | 862 |
| Divorciado | 560 |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 8.223499951 |
| Min length | 5 |
Characters and Unicode
| Total characters | 748725 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Union Libre |
|---|---|
| 2nd row | Union Libre |
| 3rd row | Soltero |
| 4th row | Soltero |
| 5th row | Soltero |
| Value | Count | Frequency (%) |
| Union Libre | 34793 | |
| Casado | 27732 | |
| Soltero | 27100 | |
| Viudo | 862 | 0.9% |
| Divorciado | 560 | 0.6% |
| Value | Count | Frequency (%) |
| libre | 34793 | |
| union | 34793 | |
| casado | 27732 | |
| soltero | 27100 | |
| viudo | 862 | 0.7% |
| divorciado | 560 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 118707 | |
| i | 71568 | |
| n | 69586 | |
| r | 62453 | 8.3% |
| e | 61893 | 8.3% |
| a | 56024 | 7.5% |
| U | 34793 | 4.6% |
| 34793 | 4.6% | |
| L | 34793 | 4.6% |
| b | 34793 | 4.6% |
| Other values (11) | 169322 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 588092 | |
| Uppercase Letter | 125840 | 16.8% |
| Space Separator | 34793 | 4.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 118707 | |
| i | 71568 | |
| n | 69586 | |
| r | 62453 | |
| e | 61893 | |
| a | 56024 | |
| b | 34793 | 5.9% |
| d | 29154 | 5.0% |
| s | 27732 | 4.7% |
| l | 27100 | 4.6% |
| Other values (4) | 29082 | 4.9% |
| Value | Count | Frequency (%) |
| U | 34793 | |
| L | 34793 | |
| C | 27732 | |
| S | 27100 | |
| V | 862 | 0.7% |
| D | 560 | 0.4% |
| Value | Count | Frequency (%) |
| 34793 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 713932 | |
| Common | 34793 | 4.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 118707 | |
| i | 71568 | |
| n | 69586 | |
| r | 62453 | |
| e | 61893 | |
| a | 56024 | 7.8% |
| U | 34793 | 4.9% |
| L | 34793 | 4.9% |
| b | 34793 | 4.9% |
| d | 29154 | 4.1% |
| Other values (10) | 140168 |
| Value | Count | Frequency (%) |
| 34793 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 748725 |
Most frequent character per block
| Value | Count | Frequency (%) |
| o | 118707 | |
| i | 71568 | |
| n | 69586 | |
| r | 62453 | 8.3% |
| e | 61893 | 8.3% |
| a | 56024 | 7.5% |
| U | 34793 | 4.6% |
| 34793 | 4.6% | |
| L | 34793 | 4.6% |
| b | 34793 | 4.6% |
| Other values (11) | 169322 |
edad
Real number (ℝ≥0)
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.11591815 |
| Minimum | 18 |
|---|---|
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 711.4 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 26 |
| Q1 | 35 |
| median | 43 |
| Q3 | 52 |
| 95-th percentile | 64 |
| Maximum | 98 |
| Range | 80 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 11.92135914 |
|---|---|
| Coefficient of variation (CV) | 0.2702280636 |
| Kurtosis | -0.3784568293 |
| Mean | 44.11591815 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.3400818195 |
| Sum | 4016622 |
| Variance | 142.1188037 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 44 | 3405 | 3.7% |
| 40 | 3167 | 3.5% |
| 41 | 2935 | 3.2% |
| 45 | 2855 | 3.1% |
| 35 | 2811 | 3.1% |
| 49 | 2796 | 3.1% |
| 46 | 2687 | 3.0% |
| 36 | 2656 | 2.9% |
| 31 | 2603 | 2.9% |
| 39 | 2567 | 2.8% |
| Other values (63) | 62565 |
| Value | Count | Frequency (%) |
| 18 | 12 | < 0.1% |
| 19 | 66 | 0.1% |
| 20 | 215 | |
| 21 | 291 | |
| 22 | 461 |
| Value | Count | Frequency (%) |
| 98 | 2 | |
| 90 | 3 | |
| 89 | 1 | < 0.1% |
| 88 | 2 | |
| 87 | 1 | < 0.1% |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| ARAUCA | |
|---|---|
| TAME | |
| ARAUQUITA | |
| PUERTO RONDON | |
| SARAVENA | |
| Other values (22) |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 6.560325985 |
| Min length | 4 |
Characters and Unicode
| Total characters | 597298 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ARAUQUITA |
|---|---|
| 2nd row | ARAUQUITA |
| 3rd row | ARAUQUITA |
| 4th row | ARAUQUITA |
| 5th row | ARAUQUITA |
| Value | Count | Frequency (%) |
| ARAUCA | 39508 | |
| TAME | 24614 | |
| ARAUQUITA | 9258 | 10.2% |
| PUERTO RONDON | 4662 | 5.1% |
| SARAVENA | 4288 | 4.7% |
| PUERTO JORDAN | 4161 | 4.6% |
| FORTUL | 3653 | 4.0% |
| PANAMA | 335 | 0.4% |
| HATO COROZAL | 213 | 0.2% |
| CRAVO NORTE | 148 | 0.2% |
| Other values (17) | 207 | 0.2% |
| Value | Count | Frequency (%) |
| arauca | 39508 | |
| tame | 24614 | |
| arauquita | 9258 | 9.2% |
| puerto | 8823 | 8.8% |
| rondon | 4662 | 4.7% |
| saravena | 4288 | 4.3% |
| jordan | 4161 | 4.2% |
| fortul | 3653 | 3.6% |
| panama | 335 | 0.3% |
| hato | 213 | 0.2% |
| Other values (24) | 726 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.6% |
| U | 70628 | 11.8% |
| T | 46747 | 7.8% |
| C | 40042 | 6.7% |
| E | 37927 | 6.3% |
| O | 26928 | 4.5% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (16) | 57560 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 588103 | |
| Space Separator | 9194 | 1.5% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.7% |
| U | 70628 | 12.0% |
| T | 46747 | 7.9% |
| C | 40042 | 6.8% |
| E | 37927 | 6.4% |
| O | 26928 | 4.6% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (14) | 48365 | 8.2% |
| Value | Count | Frequency (%) |
| 9194 |
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 588103 | |
| Common | 9195 | 1.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.7% |
| U | 70628 | 12.0% |
| T | 46747 | 7.9% |
| C | 40042 | 6.8% |
| E | 37927 | 6.4% |
| O | 26928 | 4.6% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (14) | 48365 | 8.2% |
| Value | Count | Frequency (%) |
| 9194 | ||
| . | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 597297 | |
| None | 1 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.6% |
| U | 70628 | 11.8% |
| T | 46747 | 7.8% |
| C | 40042 | 6.7% |
| E | 37927 | 6.3% |
| O | 26928 | 4.5% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (15) | 57559 | 9.6% |
| Value | Count | Frequency (%) |
| Á | 1 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| Natural |
|---|
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 637329 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Natural |
|---|---|
| 2nd row | Natural |
| 3rd row | Natural |
| 4th row | Natural |
| 5th row | Natural |
| Value | Count | Frequency (%) |
| Natural | 91047 |
| Value | Count | Frequency (%) |
| natural | 91047 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 182094 | |
| N | 91047 | |
| t | 91047 | |
| u | 91047 | |
| r | 91047 | |
| l | 91047 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 546282 | |
| Uppercase Letter | 91047 | 14.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 182094 | |
| t | 91047 | |
| u | 91047 | |
| r | 91047 | |
| l | 91047 |
| Value | Count | Frequency (%) |
| N | 91047 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 637329 |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 182094 | |
| N | 91047 | |
| t | 91047 | |
| u | 91047 | |
| r | 91047 | |
| l | 91047 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 637329 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 182094 | |
| N | 91047 | |
| t | 91047 | |
| u | 91047 | |
| r | 91047 | |
| l | 91047 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| ARAUCA | |
|---|---|
| TAME | |
| ARAUQUITA | |
| PUERTO RONDON | |
| SARAVENA | |
| Other values (22) |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 6.560325985 |
| Min length | 4 |
Characters and Unicode
| Total characters | 597298 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ARAUQUITA |
|---|---|
| 2nd row | ARAUQUITA |
| 3rd row | ARAUQUITA |
| 4th row | ARAUQUITA |
| 5th row | ARAUQUITA |
| Value | Count | Frequency (%) |
| ARAUCA | 39508 | |
| TAME | 24614 | |
| ARAUQUITA | 9258 | 10.2% |
| PUERTO RONDON | 4662 | 5.1% |
| SARAVENA | 4288 | 4.7% |
| PUERTO JORDAN | 4161 | 4.6% |
| FORTUL | 3653 | 4.0% |
| PANAMA | 335 | 0.4% |
| HATO COROZAL | 213 | 0.2% |
| CRAVO NORTE | 148 | 0.2% |
| Other values (17) | 207 | 0.2% |
| Value | Count | Frequency (%) |
| arauca | 39508 | |
| tame | 24614 | |
| arauquita | 9258 | 9.2% |
| puerto | 8823 | 8.8% |
| rondon | 4662 | 4.7% |
| saravena | 4288 | 4.3% |
| jordan | 4161 | 4.2% |
| fortul | 3653 | 3.6% |
| panama | 335 | 0.3% |
| hato | 213 | 0.2% |
| Other values (24) | 726 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.6% |
| U | 70628 | 11.8% |
| T | 46747 | 7.8% |
| C | 40042 | 6.7% |
| E | 37927 | 6.3% |
| O | 26928 | 4.5% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (16) | 57560 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 588103 | |
| Space Separator | 9194 | 1.5% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.7% |
| U | 70628 | 12.0% |
| T | 46747 | 7.9% |
| C | 40042 | 6.8% |
| E | 37927 | 6.4% |
| O | 26928 | 4.6% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (14) | 48365 | 8.2% |
| Value | Count | Frequency (%) |
| 9194 |
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 588103 | |
| Common | 9195 | 1.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.7% |
| U | 70628 | 12.0% |
| T | 46747 | 7.9% |
| C | 40042 | 6.8% |
| E | 37927 | 6.4% |
| O | 26928 | 4.6% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (14) | 48365 | 8.2% |
| Value | Count | Frequency (%) |
| 9194 | ||
| . | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 597297 | |
| None | 1 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.6% |
| U | 70628 | 11.8% |
| T | 46747 | 7.8% |
| C | 40042 | 6.7% |
| E | 37927 | 6.3% |
| O | 26928 | 4.5% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (15) | 57559 | 9.6% |
| Value | Count | Frequency (%) |
| Á | 1 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| ARAUCA | |
|---|---|
| TAME | |
| ARAUQUITA | |
| PUERTO RONDON | |
| SARAVENA | |
| Other values (22) |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 6.560325985 |
| Min length | 4 |
Characters and Unicode
| Total characters | 597298 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ARAUQUITA |
|---|---|
| 2nd row | ARAUQUITA |
| 3rd row | ARAUQUITA |
| 4th row | ARAUQUITA |
| 5th row | ARAUQUITA |
| Value | Count | Frequency (%) |
| ARAUCA | 39508 | |
| TAME | 24614 | |
| ARAUQUITA | 9258 | 10.2% |
| PUERTO RONDON | 4662 | 5.1% |
| SARAVENA | 4288 | 4.7% |
| PUERTO JORDAN | 4161 | 4.6% |
| FORTUL | 3653 | 4.0% |
| PANAMA | 335 | 0.4% |
| HATO COROZAL | 213 | 0.2% |
| CRAVO NORTE | 148 | 0.2% |
| Other values (17) | 207 | 0.2% |
| Value | Count | Frequency (%) |
| arauca | 39508 | |
| tame | 24614 | |
| arauquita | 9258 | 9.2% |
| puerto | 8823 | 8.8% |
| rondon | 4662 | 4.7% |
| saravena | 4288 | 4.3% |
| jordan | 4161 | 4.2% |
| fortul | 3653 | 3.6% |
| panama | 335 | 0.3% |
| hato | 213 | 0.2% |
| Other values (24) | 726 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.6% |
| U | 70628 | 11.8% |
| T | 46747 | 7.8% |
| C | 40042 | 6.7% |
| E | 37927 | 6.3% |
| O | 26928 | 4.5% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (16) | 57560 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 588103 | |
| Space Separator | 9194 | 1.5% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.7% |
| U | 70628 | 12.0% |
| T | 46747 | 7.9% |
| C | 40042 | 6.8% |
| E | 37927 | 6.4% |
| O | 26928 | 4.6% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (14) | 48365 | 8.2% |
| Value | Count | Frequency (%) |
| 9194 |
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 588103 | |
| Common | 9195 | 1.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.7% |
| U | 70628 | 12.0% |
| T | 46747 | 7.9% |
| C | 40042 | 6.8% |
| E | 37927 | 6.4% |
| O | 26928 | 4.6% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (14) | 48365 | 8.2% |
| Value | Count | Frequency (%) |
| 9194 | ||
| . | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 597297 | |
| None | 1 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 189898 | |
| R | 74982 | 12.6% |
| U | 70628 | 11.8% |
| T | 46747 | 7.8% |
| C | 40042 | 6.7% |
| E | 37927 | 6.3% |
| O | 26928 | 4.5% |
| M | 24994 | 4.2% |
| N | 18280 | 3.1% |
| I | 9312 | 1.6% |
| Other values (15) | 57559 | 9.6% |
| Value | Count | Frequency (%) |
| Á | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| Si | |
|---|---|
| No |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 182094 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Si |
|---|---|
| 2nd row | Si |
| 3rd row | No |
| 4th row | No |
| 5th row | Si |
| Value | Count | Frequency (%) |
| Si | 64704 | |
| No | 26343 |
| Value | Count | Frequency (%) |
| si | 64704 | |
| no | 26343 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 64704 | |
| i | 64704 | |
| N | 26343 | |
| o | 26343 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 91047 | |
| Lowercase Letter | 91047 |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 64704 | |
| N | 26343 |
| Value | Count | Frequency (%) |
| i | 64704 | |
| o | 26343 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 182094 |
Most frequent character per script
| Value | Count | Frequency (%) |
| S | 64704 | |
| i | 64704 | |
| N | 26343 | |
| o | 26343 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 182094 |
Most frequent character per block
| Value | Count | Frequency (%) |
| S | 64704 | |
| i | 64704 | |
| N | 26343 | |
| o | 26343 |
| Distinct | 683 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2859063.011 |
| Minimum | 0 |
|---|---|
| Maximum | 3204740882 |
| Zeros | 356 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 711.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 700000 |
| Q1 | 908526 |
| median | 1500000 |
| Q3 | 2500000 |
| 95-th percentile | 4500000 |
| Maximum | 3204740882 |
| Range | 3204740882 |
| Interquartile range (IQR) | 1591474 |
Descriptive statistics
| Standard deviation | 46993410.9 |
|---|---|
| Coefficient of variation (CV) | 16.4366475 |
| Kurtosis | 3260.414092 |
| Mean | 2859063.011 |
| Median Absolute Deviation (MAD) | 591474 |
| Skewness | 55.32516627 |
| Sum | 2.6030911 × 1011 |
| Variance | 2.208380668 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 908526 | 11026 | 12.1% |
| 2000000 | 9249 | 10.2% |
| 1000000 | 7694 | 8.5% |
| 1500000 | 6764 | 7.4% |
| 1200000 | 5502 | 6.0% |
| 3000000 | 4903 | 5.4% |
| 800000 | 3043 | 3.3% |
| 4000000 | 2992 | 3.3% |
| 1800000 | 2854 | 3.1% |
| 2500000 | 2743 | 3.0% |
| Other values (673) | 34277 |
| Value | Count | Frequency (%) |
| 0 | 356 | |
| 152 | 131 | 0.1% |
| 800 | 2 | < 0.1% |
| 30000 | 1 | < 0.1% |
| 93000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3204740882 | 3 | < 0.1% |
| 3203428840 | 2 | < 0.1% |
| 3174339461 | 3 | < 0.1% |
| 3016878081 | 4 | < 0.1% |
| 2000000000 | 16 |
| Distinct | 96 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 69004 |
| Missing (%) | 75.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1421585.435 |
| Minimum | 0 |
|---|---|
| Maximum | 3143089323 |
| Zeros | 15957 |
| Zeros (%) | 17.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 711.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 300000 |
| 95-th percentile | 2000000 |
| Maximum | 3143089323 |
| Range | 3143089323 |
| Interquartile range (IQR) | 300000 |
Descriptive statistics
| Standard deviation | 56026266.07 |
|---|---|
| Coefficient of variation (CV) | 39.4111143 |
| Kurtosis | 3050.806515 |
| Mean | 1421585.435 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 55.02864117 |
| Sum | 3.133600775 × 1010 |
| Variance | 3.13894249 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 15957 | 17.5% |
| 1000000 | 894 | 1.0% |
| 500000 | 706 | 0.8% |
| 400000 | 506 | 0.6% |
| 600000 | 423 | 0.5% |
| 2000000 | 388 | 0.4% |
| 800000 | 337 | 0.4% |
| 200000 | 319 | 0.4% |
| 300000 | 279 | 0.3% |
| 1500000 | 234 | 0.3% |
| Other values (86) | 2000 | 2.2% |
| (Missing) | 69004 |
| Value | Count | Frequency (%) |
| 0 | 15957 | |
| 50000 | 3 | < 0.1% |
| 80000 | 2 | < 0.1% |
| 100000 | 79 | 0.1% |
| 120000 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 3143089323 | 1 | < 0.1% |
| 3118534287 | 2 | < 0.1% |
| 3114770734 | 4 | < 0.1% |
| 1000000000 | 1 | < 0.1% |
| 60000000 | 18 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| ARAUCA | |
|---|---|
| TAME | |
| ARAUQUITA | |
| SARAVENA | |
| PUERTO RONDON | 3691 |
| Other values (4) |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.431820928 |
| Min length | 4 |
Characters and Unicode
| Total characters | 585598 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ARAUQUITA |
|---|---|
| 2nd row | ARAUQUITA |
| 3rd row | ARAUQUITA |
| 4th row | ARAUQUITA |
| 5th row | ARAUQUITA |
| Value | Count | Frequency (%) |
| ARAUCA | 40814 | |
| TAME | 25119 | |
| ARAUQUITA | 8799 | 9.7% |
| SARAVENA | 5845 | 6.4% |
| PUERTO RONDON | 3691 | 4.1% |
| PUERTO JORDAN | 3660 | 4.0% |
| FORTUL | 3066 | 3.4% |
| PANAMA | 51 | 0.1% |
| CRAVO NORTE | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| arauca | 40814 | |
| tame | 25119 | |
| arauquita | 8799 | 8.9% |
| puerto | 7351 | 7.5% |
| saravena | 5845 | 5.9% |
| rondon | 3691 | 3.8% |
| jordan | 3660 | 3.7% |
| fortul | 3066 | 3.1% |
| panama | 51 | 0.1% |
| norte | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 195308 | |
| R | 73230 | 12.5% |
| U | 68829 | 11.8% |
| T | 44337 | 7.6% |
| C | 40816 | 7.0% |
| E | 38317 | 6.5% |
| M | 25170 | 4.3% |
| O | 21463 | 3.7% |
| N | 16940 | 2.9% |
| Q | 8799 | 1.5% |
| Other values (9) | 52389 | 8.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 578245 | |
| Space Separator | 7353 | 1.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 195308 | |
| R | 73230 | 12.7% |
| U | 68829 | 11.9% |
| T | 44337 | 7.7% |
| C | 40816 | 7.1% |
| E | 38317 | 6.6% |
| M | 25170 | 4.4% |
| O | 21463 | 3.7% |
| N | 16940 | 2.9% |
| Q | 8799 | 1.5% |
| Other values (8) | 45036 | 7.8% |
| Value | Count | Frequency (%) |
| 7353 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 578245 | |
| Common | 7353 | 1.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 195308 | |
| R | 73230 | 12.7% |
| U | 68829 | 11.9% |
| T | 44337 | 7.7% |
| C | 40816 | 7.1% |
| E | 38317 | 6.6% |
| M | 25170 | 4.4% |
| O | 21463 | 3.7% |
| N | 16940 | 2.9% |
| Q | 8799 | 1.5% |
| Other values (8) | 45036 | 7.8% |
| Value | Count | Frequency (%) |
| 7353 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 585598 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 195308 | |
| R | 73230 | 12.5% |
| U | 68829 | 11.8% |
| T | 44337 | 7.6% |
| C | 40816 | 7.0% |
| E | 38317 | 6.5% |
| M | 25170 | 4.3% |
| O | 21463 | 3.7% |
| N | 16940 | 2.9% |
| Q | 8799 | 1.5% |
| Other values (9) | 52389 | 8.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| SIN CODEUDOR | |
|---|---|
| CON CODEUDOR |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 1092564 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SIN CODEUDOR |
|---|---|
| 2nd row | SIN CODEUDOR |
| 3rd row | CON CODEUDOR |
| 4th row | SIN CODEUDOR |
| 5th row | SIN CODEUDOR |
| Value | Count | Frequency (%) |
| SIN CODEUDOR | 80123 | |
| CON CODEUDOR | 10924 | 12.0% |
| Value | Count | Frequency (%) |
| codeudor | 91047 | |
| sin | 80123 | |
| con | 10924 | 6.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 193018 | |
| D | 182094 | |
| C | 101971 | |
| N | 91047 | |
| 91047 | ||
| E | 91047 | |
| U | 91047 | |
| R | 91047 | |
| S | 80123 | |
| I | 80123 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1001517 | |
| Space Separator | 91047 | 8.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| O | 193018 | |
| D | 182094 | |
| C | 101971 | |
| N | 91047 | |
| E | 91047 | |
| U | 91047 | |
| R | 91047 | |
| S | 80123 | |
| I | 80123 |
| Value | Count | Frequency (%) |
| 91047 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1001517 | |
| Common | 91047 | 8.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| O | 193018 | |
| D | 182094 | |
| C | 101971 | |
| N | 91047 | |
| E | 91047 | |
| U | 91047 | |
| R | 91047 | |
| S | 80123 | |
| I | 80123 |
| Value | Count | Frequency (%) |
| 91047 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1092564 |
Most frequent character per block
| Value | Count | Frequency (%) |
| O | 193018 | |
| D | 182094 | |
| C | 101971 | |
| N | 91047 | |
| 91047 | ||
| E | 91047 | |
| U | 91047 | |
| R | 91047 | |
| S | 80123 | |
| I | 80123 |
| Distinct | 151 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 711.4 KiB |
| INDEPENDIENTES ARAUCA | |
|---|---|
| CONTADO | |
| CREDITOS TAME | |
| CONTADOS DE TAME | |
| INDEP. ARAUQUITA | |
| Other values (146) |
Length
| Max length | 35 |
|---|---|
| Median length | 16 |
| Mean length | 15.8191903 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1440274 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | INDEP. ARAUQUITA |
|---|---|
| 2nd row | INDEP. ARAUQUITA |
| 3rd row | INDEP. ARAUQUITA |
| 4th row | INDEP. ARAUQUITA |
| 5th row | INDEP. ARAUQUITA |
| Value | Count | Frequency (%) |
| INDEPENDIENTES ARAUCA | 20352 | |
| CONTADO | 11305 | |
| CREDITOS TAME | 11002 | |
| CONTADOS DE TAME | 8118 | 8.9% |
| INDEP. ARAUQUITA | 5015 | 5.5% |
| PUEBLO NUEVO | 3068 | 3.4% |
| FORTUL | 2950 | 3.2% |
| CONTADOS ARAUQUITA | 2919 | 3.2% |
| PUERTO RONDON | 2896 | 3.2% |
| FONDO EDUCATIVO REGIONAL | 2584 | 2.8% |
| Other values (141) | 20837 |
| Value | Count | Frequency (%) |
| arauca | 24020 | |
| tame | 22833 | |
| independientes | 22644 | |
| de | 15003 | 8.0% |
| contados | 13474 | 7.2% |
| creditos | 11753 | 6.3% |
| contado | 11305 | 6.0% |
| arauquita | 8721 | 4.6% |
| indep | 5015 | 2.7% |
| saravena | 4858 | 2.6% |
| Other values (179) | 48271 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 188981 | |
| E | 178020 | |
| N | 126055 | |
| D | 118282 | |
| O | 114616 | |
| T | 111900 | |
| 96888 | 6.7% | |
| I | 92010 | 6.4% |
| R | 73792 | 5.1% |
| C | 70970 | 4.9% |
| Other values (34) | 268760 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1330368 | |
| Space Separator | 96888 | 6.7% |
| Other Punctuation | 10181 | 0.7% |
| Dash Punctuation | 1022 | 0.1% |
| Open Punctuation | 881 | 0.1% |
| Close Punctuation | 859 | 0.1% |
| Lowercase Letter | 57 | < 0.1% |
| Decimal Number | 18 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 188981 | |
| E | 178020 | |
| N | 126055 | |
| D | 118282 | |
| O | 114616 | |
| T | 111900 | |
| I | 92010 | |
| R | 73792 | 5.5% |
| C | 70970 | 5.3% |
| S | 69155 | 5.2% |
| Other values (18) | 186587 |
| Value | Count | Frequency (%) |
| o | 22 | |
| a | 15 | |
| r | 5 | 8.8% |
| v | 5 | 8.8% |
| e | 5 | 8.8% |
| n | 5 | 8.8% |
| Value | Count | Frequency (%) |
| . | 8285 | |
| , | 1185 | 11.6% |
| " | 702 | 6.9% |
| % | 9 | 0.1% |
| Value | Count | Frequency (%) |
| 6 | 9 | |
| 0 | 9 |
| Value | Count | Frequency (%) |
| 96888 |
| Value | Count | Frequency (%) |
| - | 1022 |
| Value | Count | Frequency (%) |
| ( | 881 |
| Value | Count | Frequency (%) |
| ) | 859 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1330425 | |
| Common | 109849 | 7.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 188981 | |
| E | 178020 | |
| N | 126055 | |
| D | 118282 | |
| O | 114616 | |
| T | 111900 | |
| I | 92010 | |
| R | 73792 | 5.5% |
| C | 70970 | 5.3% |
| S | 69155 | 5.2% |
| Other values (24) | 186644 |
| Value | Count | Frequency (%) |
| 96888 | ||
| . | 8285 | 7.5% |
| , | 1185 | 1.1% |
| - | 1022 | 0.9% |
| ( | 881 | 0.8% |
| ) | 859 | 0.8% |
| " | 702 | 0.6% |
| 6 | 9 | < 0.1% |
| 0 | 9 | < 0.1% |
| % | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1440033 | |
| None | 241 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 188981 | |
| E | 178020 | |
| N | 126055 | |
| D | 118282 | |
| O | 114616 | |
| T | 111900 | |
| 96888 | 6.7% | |
| I | 92010 | 6.4% |
| R | 73792 | 5.1% |
| C | 70970 | 4.9% |
| Other values (32) | 268519 |
| Value | Count | Frequency (%) |
| Ñ | 205 | |
| Á | 36 | 14.9% |
año_credito
Real number (ℝ≥0)
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2014.573846 |
| Minimum | 1993 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 711.4 KiB |
Quantile statistics
| Minimum | 1993 |
|---|---|
| 5-th percentile | 2004 |
| Q1 | 2012 |
| median | 2016 |
| Q3 | 2019 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 28 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 5.234178671 |
|---|---|
| Coefficient of variation (CV) | 0.002598156766 |
| Kurtosis | 0.5840257855 |
| Mean | 2014.573846 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -1.068515684 |
| Sum | 183420905 |
| Variance | 27.39662636 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2020 | 12916 | |
| 2019 | 12395 | |
| 2018 | 7488 | 8.2% |
| 2015 | 7058 | 7.8% |
| 2017 | 6514 | 7.2% |
| 2016 | 5967 | 6.6% |
| 2014 | 5513 | 6.1% |
| 2013 | 4956 | 5.4% |
| 2012 | 4464 | 4.9% |
| 2011 | 3847 | 4.2% |
| Other values (16) | 19929 |
| Value | Count | Frequency (%) |
| 1993 | 1 | < 0.1% |
| 1997 | 206 | 0.2% |
| 1998 | 404 | |
| 1999 | 514 | |
| 2000 | 597 |
| Value | Count | Frequency (%) |
| 2021 | 2020 | 2.2% |
| 2020 | 12916 | |
| 2019 | 12395 | |
| 2018 | 7488 | |
| 2017 | 6514 |
mes_credito
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.831076257 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 711.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.48121566 |
|---|---|
| Coefficient of variation (CV) | 0.5096145218 |
| Kurtosis | -1.218887257 |
| Mean | 6.831076257 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.108812585 |
| Sum | 621949 |
| Variance | 12.11886247 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 9588 | |
| 10 | 8806 | |
| 11 | 7988 | |
| 9 | 7859 | |
| 7 | 7722 | |
| 6 | 7512 | |
| 5 | 7443 | |
| 8 | 7060 | |
| 3 | 7048 | |
| 2 | 6931 | |
| Other values (2) | 13090 |
| Value | Count | Frequency (%) |
| 1 | 6651 | |
| 2 | 6931 | |
| 3 | 7048 | |
| 4 | 6439 | |
| 5 | 7443 |
| Value | Count | Frequency (%) |
| 12 | 9588 | |
| 11 | 7988 | |
| 10 | 8806 | |
| 9 | 7859 | |
| 8 | 7060 |
| Distinct | 6710 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1292348.912 |
| Minimum | 1 |
|---|---|
| Maximum | 299250000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 711.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 60000 |
| Q1 | 350000 |
| median | 881000 |
| Q3 | 1570000 |
| 95-th percentile | 4214700 |
| Maximum | 299250000 |
| Range | 299249999 |
| Interquartile range (IQR) | 1220000 |
Descriptive statistics
| Standard deviation | 1982037.223 |
|---|---|
| Coefficient of variation (CV) | 1.533670362 |
| Kurtosis | 6434.10913 |
| Mean | 1292348.912 |
| Median Absolute Deviation (MAD) | 591000 |
| Skewness | 48.6531017 |
| Sum | 1.176644914 × 1011 |
| Variance | 3.928471555 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000 | 669 | 0.7% |
| 160000 | 641 | 0.7% |
| 200000 | 633 | 0.7% |
| 1200000 | 628 | 0.7% |
| 180000 | 623 | 0.7% |
| 1000000 | 614 | 0.7% |
| 130000 | 598 | 0.7% |
| 50000 | 595 | 0.7% |
| 400000 | 551 | 0.6% |
| 600000 | 546 | 0.6% |
| Other values (6700) | 84949 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 1000 | 6 | |
| 1250 | 1 | < 0.1% |
| 1300 | 1 | < 0.1% |
| 1500 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 299250000 | 1 | |
| 183719000 | 1 | |
| 75160000 | 1 | |
| 41862000 | 1 | |
| 40784000 | 1 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| PAGADO VENCIDO | |
|---|---|
| PAGADO ANTICIPADO | |
| CONTADO | |
| PAGADO A TIEMPO | |
| DESCUENTO EN VENTA | |
| Other values (3) | 1045 |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 13.45540215 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1225074 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAGADO VENCIDO |
|---|---|
| 2nd row | PAGADO VENCIDO |
| 3rd row | PAGADO VENCIDO |
| 4th row | PAGADO ANTICIPADO |
| 5th row | PAGADO VENCIDO |
| Value | Count | Frequency (%) |
| PAGADO VENCIDO | 38859 | |
| PAGADO ANTICIPADO | 20290 | |
| CONTADO | 19522 | |
| PAGADO A TIEMPO | 5989 | 6.6% |
| DESCUENTO EN VENTA | 5342 | 5.9% |
| DEVOLUCION | 528 | 0.6% |
| CARTERA CASTIGADA | 368 | 0.4% |
| OTROS CIERRES | 149 | 0.2% |
| Value | Count | Frequency (%) |
| pagado | 65138 | |
| vencido | 38859 | |
| anticipado | 20290 | 11.7% |
| contado | 19522 | 11.3% |
| tiempo | 5989 | 3.5% |
| a | 5989 | 3.5% |
| venta | 5342 | 3.1% |
| descuento | 5342 | 3.1% |
| en | 5342 | 3.1% |
| devolucion | 528 | 0.3% |
| Other values (4) | 1034 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 203549 | |
| O | 176016 | |
| D | 150047 | |
| N | 95225 | |
| P | 91417 | |
| I | 86473 | |
| C | 85426 | |
| 82328 | ||
| E | 67410 | 5.5% |
| G | 65506 | 5.3% |
| Other values (7) | 121677 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1142746 | |
| Space Separator | 82328 | 6.7% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 203549 | |
| O | 176016 | |
| D | 150047 | |
| N | 95225 | |
| P | 91417 | |
| I | 86473 | |
| C | 85426 | |
| E | 67410 | 5.9% |
| G | 65506 | 5.7% |
| T | 57370 | 5.0% |
| Other values (6) | 64307 | 5.6% |
| Value | Count | Frequency (%) |
| 82328 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1142746 | |
| Common | 82328 | 6.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 203549 | |
| O | 176016 | |
| D | 150047 | |
| N | 95225 | |
| P | 91417 | |
| I | 86473 | |
| C | 85426 | |
| E | 67410 | 5.9% |
| G | 65506 | 5.7% |
| T | 57370 | 5.0% |
| Other values (6) | 64307 | 5.6% |
| Value | Count | Frequency (%) |
| 82328 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1225074 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 203549 | |
| O | 176016 | |
| D | 150047 | |
| N | 95225 | |
| P | 91417 | |
| I | 86473 | |
| C | 85426 | |
| 82328 | ||
| E | 67410 | 5.5% |
| G | 65506 | 5.3% |
| Other values (7) | 121677 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| ELECTRODOMESTICOS | |
|---|---|
| MOTOS | 811 |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 16.89311015 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1538067 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ELECTRODOMESTICOS |
|---|---|
| 2nd row | ELECTRODOMESTICOS |
| 3rd row | ELECTRODOMESTICOS |
| 4th row | ELECTRODOMESTICOS |
| 5th row | ELECTRODOMESTICOS |
| Value | Count | Frequency (%) |
| ELECTRODOMESTICOS | 90236 | |
| MOTOS | 811 | 0.9% |
| Value | Count | Frequency (%) |
| electrodomesticos | 90236 | |
| motos | 811 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 272330 | |
| E | 270708 | |
| T | 181283 | |
| S | 181283 | |
| C | 180472 | |
| M | 91047 | 5.9% |
| L | 90236 | 5.9% |
| R | 90236 | 5.9% |
| D | 90236 | 5.9% |
| I | 90236 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1538067 |
Most frequent character per category
| Value | Count | Frequency (%) |
| O | 272330 | |
| E | 270708 | |
| T | 181283 | |
| S | 181283 | |
| C | 180472 | |
| M | 91047 | 5.9% |
| L | 90236 | 5.9% |
| R | 90236 | 5.9% |
| D | 90236 | 5.9% |
| I | 90236 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1538067 |
Most frequent character per script
| Value | Count | Frequency (%) |
| O | 272330 | |
| E | 270708 | |
| T | 181283 | |
| S | 181283 | |
| C | 180472 | |
| M | 91047 | 5.9% |
| L | 90236 | 5.9% |
| R | 90236 | 5.9% |
| D | 90236 | 5.9% |
| I | 90236 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1538067 |
Most frequent character per block
| Value | Count | Frequency (%) |
| O | 272330 | |
| E | 270708 | |
| T | 181283 | |
| S | 181283 | |
| C | 180472 | |
| M | 91047 | 5.9% |
| L | 90236 | 5.9% |
| R | 90236 | 5.9% |
| D | 90236 | 5.9% |
| I | 90236 | 5.9% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| MENSUAL(ES) | |
|---|---|
| DIARIA(S) | |
| SEMANAL(ES) | 775 |
| QUINCENAL(ES) | 252 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 10.6401419 |
| Min length | 9 |
Characters and Unicode
| Total characters | 968753 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MENSUAL(ES) |
|---|---|
| 2nd row | MENSUAL(ES) |
| 3rd row | MENSUAL(ES) |
| 4th row | MENSUAL(ES) |
| 5th row | MENSUAL(ES) |
| Value | Count | Frequency (%) |
| MENSUAL(ES) | 73386 | |
| DIARIA(S) | 16634 | 18.3% |
| SEMANAL(ES) | 775 | 0.9% |
| QUINCENAL(ES) | 252 | 0.3% |
| Value | Count | Frequency (%) |
| mensual(es | 73386 | |
| diaria(s | 16634 | 18.3% |
| semanal(es | 775 | 0.9% |
| quincenal(es | 252 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 165208 | |
| E | 148826 | |
| A | 108456 | |
| ( | 91047 | |
| ) | 91047 | |
| N | 74665 | |
| L | 74413 | |
| M | 74161 | |
| U | 73638 | |
| I | 33520 | 3.5% |
| Other values (4) | 33772 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 786659 | |
| Open Punctuation | 91047 | 9.4% |
| Close Punctuation | 91047 | 9.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 165208 | |
| E | 148826 | |
| A | 108456 | |
| N | 74665 | |
| L | 74413 | |
| M | 74161 | |
| U | 73638 | |
| I | 33520 | 4.3% |
| D | 16634 | 2.1% |
| R | 16634 | 2.1% |
| Other values (2) | 504 | 0.1% |
| Value | Count | Frequency (%) |
| ( | 91047 |
| Value | Count | Frequency (%) |
| ) | 91047 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 786659 | |
| Common | 182094 | 18.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| S | 165208 | |
| E | 148826 | |
| A | 108456 | |
| N | 74665 | |
| L | 74413 | |
| M | 74161 | |
| U | 73638 | |
| I | 33520 | 4.3% |
| D | 16634 | 2.1% |
| R | 16634 | 2.1% |
| Other values (2) | 504 | 0.1% |
| Value | Count | Frequency (%) |
| ( | 91047 | |
| ) | 91047 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 968753 |
Most frequent character per block
| Value | Count | Frequency (%) |
| S | 165208 | |
| E | 148826 | |
| A | 108456 | |
| ( | 91047 | |
| ) | 91047 | |
| N | 74665 | |
| L | 74413 | |
| M | 74161 | |
| U | 73638 | |
| I | 33520 | 3.5% |
| Other values (4) | 33772 | 3.5% |
cuotas
Real number (ℝ≥0)
| Distinct | 320 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.934308654 |
| Minimum | 0 |
|---|---|
| Maximum | 959 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 711.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 9 |
| 95-th percentile | 15 |
| Maximum | 959 |
| Range | 959 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 23.38240162 |
|---|---|
| Coefficient of variation (CV) | 2.946999246 |
| Kurtosis | 212.8847377 |
| Mean | 7.934308654 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 11.09327379 |
| Sum | 722395 |
| Variance | 546.7367056 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 40618 | |
| 10 | 8973 | 9.9% |
| 5 | 8546 | 9.4% |
| 3 | 4197 | 4.6% |
| 2 | 4157 | 4.6% |
| 12 | 3881 | 4.3% |
| 6 | 3654 | 4.0% |
| 4 | 2897 | 3.2% |
| 14 | 2400 | 2.6% |
| 9 | 2365 | 2.6% |
| Other values (310) | 9359 | 10.3% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 40618 | |
| 2 | 4157 | 4.6% |
| 3 | 4197 | 4.6% |
| 4 | 2897 | 3.2% |
| Value | Count | Frequency (%) |
| 959 | 1 | |
| 920 | 2 | |
| 909 | 1 | |
| 801 | 1 | |
| 546 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 711.4 KiB |
| CRÉDITO | |
|---|---|
| CONTADO | |
| LIBRANZA | 2464 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.027062946 |
| Min length | 7 |
Characters and Unicode
| Total characters | 639793 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CRÉDITO |
|---|---|
| 2nd row | CRÉDITO |
| 3rd row | CRÉDITO |
| 4th row | CRÉDITO |
| 5th row | CRÉDITO |
| Value | Count | Frequency (%) |
| CRÉDITO | 69061 | |
| CONTADO | 19522 | 21.4% |
| LIBRANZA | 2464 | 2.7% |
| Value | Count | Frequency (%) |
| crédito | 69061 | |
| contado | 19522 | 21.4% |
| libranza | 2464 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 108105 | |
| C | 88583 | |
| D | 88583 | |
| T | 88583 | |
| R | 71525 | |
| I | 71525 | |
| É | 69061 | |
| A | 24450 | 3.8% |
| N | 21986 | 3.4% |
| L | 2464 | 0.4% |
| Other values (2) | 4928 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 639793 |
Most frequent character per category
| Value | Count | Frequency (%) |
| O | 108105 | |
| C | 88583 | |
| D | 88583 | |
| T | 88583 | |
| R | 71525 | |
| I | 71525 | |
| É | 69061 | |
| A | 24450 | 3.8% |
| N | 21986 | 3.4% |
| L | 2464 | 0.4% |
| Other values (2) | 4928 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 639793 |
Most frequent character per script
| Value | Count | Frequency (%) |
| O | 108105 | |
| C | 88583 | |
| D | 88583 | |
| T | 88583 | |
| R | 71525 | |
| I | 71525 | |
| É | 69061 | |
| A | 24450 | 3.8% |
| N | 21986 | 3.4% |
| L | 2464 | 0.4% |
| Other values (2) | 4928 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 570732 | |
| None | 69061 | 10.8% |
Most frequent character per block
| Value | Count | Frequency (%) |
| O | 108105 | |
| C | 88583 | |
| D | 88583 | |
| T | 88583 | |
| R | 71525 | |
| I | 71525 | |
| A | 24450 | 4.3% |
| N | 21986 | 3.9% |
| L | 2464 | 0.4% |
| B | 2464 | 0.4% |
| Value | Count | Frequency (%) |
| É | 69061 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Row | tipo_identificacion | genero | estado_civil | edad | municipio_residencia | tipo_persona | empresa | municipio_nacimiento | municipio_expedicion | tiene_casa_propia | cargo | sueldo | tiempo_servicio | otros_ingresos_mensual | otros_ingresos_concepto | municipio_credito | codeudor | entidad | año_credito | mes_credito | valor_credito | estado_final | tipo_venta | periodo_credito | cuotas | forma_pago | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | Cedula de Ciudadania | Femenino | Union Libre | 32 | ARAUQUITA | Natural | INDEPENDIENTE | ARAUQUITA | ARAUQUITA | Si | COMERCIANTE | 400000 | 2 AÑOS | NaN | NaN | ARAUQUITA | SIN CODEUDOR | INDEP. ARAUQUITA | 2020 | 3 | 1020000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 12 | CRÉDITO |
| 1 | 2 | Cedula de Ciudadania | Femenino | Union Libre | 41 | ARAUQUITA | Natural | INDEPENDIENTE | ARAUQUITA | ARAUQUITA | Si | COMERCIANTE | 2000000 | 5 AÑOS | NaN | NaN | ARAUQUITA | SIN CODEUDOR | INDEP. ARAUQUITA | 2020 | 3 | 680000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
| 2 | 3 | Cedula de Ciudadania | Femenino | Soltero | 23 | ARAUQUITA | Natural | NaN | ARAUQUITA | ARAUQUITA | No | ESTILISTA | 700000 | 3 AÑOS | NaN | NaN | ARAUQUITA | CON CODEUDOR | INDEP. ARAUQUITA | 2020 | 3 | 1155000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
| 3 | 4 | Cedula de Ciudadania | Femenino | Soltero | 39 | ARAUQUITA | Natural | FISCALIA | ARAUQUITA | ARAUQUITA | No | ASISTENTE DE FISCAL UNO | 4300000 | 3 AÑOS | NaN | NaN | ARAUQUITA | SIN CODEUDOR | INDEP. ARAUQUITA | 2020 | 2 | 3678000 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
| 4 | 5 | Cedula de Ciudadania | Femenino | Soltero | 65 | ARAUQUITA | Natural | INDEPENDIENTE | ARAUQUITA | ARAUQUITA | Si | RESTAURANTE DOÑA RITA | 1500000 | NaN | NaN | NaN | ARAUQUITA | SIN CODEUDOR | INDEP. ARAUQUITA | 2020 | 2 | 1920000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
| 5 | 6 | Cedula de Ciudadania | Femenino | Soltero | 55 | ARAUQUITA | Natural | SEAD ARAUCA | ARAUQUITA | ARAUQUITA | Si | DOCENTE | 3500000 | 28 AÑOS | NaN | NaN | ARAUQUITA | SIN CODEUDOR | INDEP. ARAUQUITA | 2020 | 2 | 515000 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
| 6 | 7 | Cedula de Ciudadania | Femenino | Union Libre | 35 | ARAUQUITA | Natural | SENA | ARAUQUITA | ARAUQUITA | Si | AUXILIAR DE ENFERMERIA | 1800000 | 1 AÑO | NaN | NaN | ARAUQUITA | SIN CODEUDOR | INDEP. ARAUQUITA | 2020 | 2 | 1050000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 1 | CRÉDITO |
| 7 | 8 | Cedula de Ciudadania | Femenino | Soltero | 46 | ARAUQUITA | Natural | INDEPENDIENTE | ARAUQUITA | ARAUQUITA | No | COMERCIANTE | 600000 | 10 AÑOS | NaN | NaN | ARAUQUITA | CON CODEUDOR | INDEP. ARAUQUITA | 2020 | 3 | 1614000 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | MENSUAL(ES) | 12 | CRÉDITO |
| 8 | 9 | Cedula de Ciudadania | Femenino | Casado | 29 | ARAUQUITA | Natural | INDEPENDIENTE | ARAUQUITA | ARAUQUITA | Si | INDEPENDIENTE | 1500000 | 5 AÑOS | NaN | NaN | ARAUQUITA | SIN CODEUDOR | INDEP. ARAUQUITA | 2020 | 3 | 860000 | DEVOLUCION | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
| 9 | 10 | Cedula de Ciudadania | Femenino | Soltero | 35 | ARAUQUITA | Natural | FEDECACAO | ARAUQUITA | ARAUQUITA | No | TECNICA DE CAMPO | 950000 | NaN | NaN | NaN | ARAUQUITA | SIN CODEUDOR | INDEP. ARAUQUITA | 2020 | 3 | 545000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 5 | CRÉDITO |
Last rows
| Row | tipo_identificacion | genero | estado_civil | edad | municipio_residencia | tipo_persona | empresa | municipio_nacimiento | municipio_expedicion | tiene_casa_propia | cargo | sueldo | tiempo_servicio | otros_ingresos_mensual | otros_ingresos_concepto | municipio_credito | codeudor | entidad | año_credito | mes_credito | valor_credito | estado_final | tipo_venta | periodo_credito | cuotas | forma_pago | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 91037 | 91038 | Cedula de Ciudadania | Masculino | Union Libre | 49 | ARAUCA | Natural | INDEPENDIENTE | ARAUCA | ARAUCA | Si | OPERADOR MAQUINARIA PESADA | 5200000 | 15 AÑOS | NaN | NaN | ARAUCA | SIN CODEUDOR | ALCALDIA DE ARAUCA | 1997 | 12 | 680400 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | MENSUAL(ES) | 12 | LIBRANZA |
| 91038 | 91039 | Cedula de Ciudadania | Masculino | Casado | 60 | ARAUCA | Natural | ALCALDIA DE ARAUCA | ARAUCA | ARAUCA | Si | BOMBEROS | 908526 | 32 AÑOS | NaN | NaN | ARAUCA | SIN CODEUDOR | ALCALDIA DE ARAUCA | 2011 | 6 | 1912000 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | MENSUAL(ES) | 15 | CRÉDITO |
| 91039 | 91040 | Cedula de Ciudadania | Masculino | Casado | 60 | ARAUCA | Natural | ALCALDIA DE ARAUCA | ARAUCA | ARAUCA | Si | BOMBEROS | 908526 | 32 AÑOS | NaN | NaN | ARAUCA | SIN CODEUDOR | ALCALDIA DE ARAUCA | 2010 | 2 | 1800000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 15 | CRÉDITO |
| 91040 | 91041 | Cedula de Ciudadania | Masculino | Casado | 60 | ARAUCA | Natural | ALCALDIA DE ARAUCA | ARAUCA | ARAUCA | Si | BOMBEROS | 908526 | 32 AÑOS | NaN | NaN | ARAUCA | SIN CODEUDOR | ALCALDIA DE ARAUCA | 2008 | 7 | 1301000 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | MENSUAL(ES) | 17 | LIBRANZA |
| 91041 | 91042 | Cedula de Ciudadania | Masculino | Soltero | 47 | SARAVENA | Natural | ALCALDIA DE SARAVENA | SARAVENA | SARAVENA | Si | INSPECTOR DE IMPUESTOS | 1500000 | 2 AÑOS | NaN | NaN | ARAUCA | SIN CODEUDOR | NaN | 2009 | 3 | 899000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 6 | CRÉDITO |
| 91042 | 91043 | Cedula de Ciudadania | Masculino | Soltero | 35 | TAME | Natural | INDEPENDENCE | TAME | TAME | No | DICTA CAPACITACIONES | 850000 | 3 MESES | NaN | NaN | ARAUCA | CON CODEUDOR | SIN ENTIDAD | 2017 | 9 | 1350000 | PAGADO ANTICIPADO | ELECTRODOMESTICOS | MENSUAL(ES) | 1 | CRÉDITO |
| 91043 | 91044 | Cedula de Ciudadania | Masculino | Casado | 39 | ARAUCA | Natural | INDEPENDIENTE | ARAUCA | ARAUCA | No | COMUNICADOR EN CAMPO | 3500000 | 5 AÑOS | NaN | NaN | ARAUCA | SIN CODEUDOR | SIN ENTIDAD | 2017 | 11 | 3250000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 4 | CRÉDITO |
| 91044 | 91045 | Cedula de Ciudadania | Masculino | Union Libre | 31 | TAME | Natural | INDEPENDIENTE | TAME | TAME | No | DELICARNES DUEÑO | 1500000 | 2 AÑOS | NaN | NaN | TAME | SIN CODEUDOR | SIN ENTIDAD | 2017 | 12 | 3410000 | PAGADO VENCIDO | ELECTRODOMESTICOS | DIARIA(S) | 153 | CRÉDITO |
| 91045 | 91046 | Cedula de Ciudadania | Masculino | Casado | 51 | ARAUCA | Natural | CORPORINOQUIA | ARAUCA | ARAUCA | Si | PERIODISTA | 3800000 | 12 MESES | 0.0 | NaN | ARAUCA | SIN CODEUDOR | ALCALDIA DE ARAUCA | 2008 | 4 | 2690000 | PAGADO VENCIDO | ELECTRODOMESTICOS | MENSUAL(ES) | 9 | LIBRANZA |
| 91046 | 91047 | Cedula de Ciudadania | Masculino | Casado | 51 | ARAUCA | Natural | CORPORINOQUIA | ARAUCA | ARAUCA | Si | PERIODISTA | 3800000 | 12 MESES | 0.0 | NaN | ARAUCA | SIN CODEUDOR | ALCALDIA DE ARAUCA | 2008 | 6 | 2628000 | DESCUENTO EN VENTA | ELECTRODOMESTICOS | MENSUAL(ES) | 11 | LIBRANZA |